Refactor Empirical distribution #308

FreezyLemon · 2024-11-10T22:36:16Z

Nothing here should be breaking. Some of the changes are opinionated code quality improvements.

Two noticeable improvements from this:

Struct size reduced (platform-dependent, but Option<(f64, f64)> is larger than two f64s. Up to 8 bytes on x64)
Avoids BTreeMap::remove inside fn remove if value > 1

Probably also removes a branch in fn add, but I honestly haven't checked the generated ASM or checked via benchmarking. Should I?

codecov · 2024-11-10T22:37:35Z

Codecov Report

Attention: Patch coverage is 97.56098% with 3 lines in your changes missing coverage. Please review.

Project coverage is 93.80%. Comparing base (252d3d7) to head (b604dbb).
Report is 15 commits behind head on master.

Files with missing lines	Patch %	Lines
src/distribution/empirical.rs	97.56%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #308      +/-   ##
==========================================
+ Coverage   93.70%   93.80%   +0.10%     
==========================================
  Files          53       53              
  Lines       11939    12002      +63     
==========================================
+ Hits        11187    11259      +72     
+ Misses        752      743       -9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

The old code always removed entries and re-inserted them if necessary. The new code will instead modify the value (= data_point count) in-place and only remove the key- value pair from the map if the count would've dropped to zero.

No value of `Infallible` can ever exist, so it is statically proven that `Result<Empirical, Infallible>` can never exist as a `Result::Err` variant. This allows layout optimizations and is arguably a clearer API.

FreezyLemon · 2024-11-11T11:28:41Z

Pushed a breaking change on top of the previous ones. Infallible is a useful type for this exact purpose, statically proves that no Err variant can ever exist of this result and allows a layout optimization (size_of::<Result<Empirical, Infallible>>() == size_of::<Empirical>())

YeungOnion · 2024-12-03T04:30:49Z

It looks good, and the tests were definitely needed for the statistics. Thank you, I appreciate what you do.
TIL there's a way around not using nightly's !!

FreezyLemon added 7 commits November 10, 2024 23:51

test: new tests for Empirical

80ed5f5

refactor: Move NonNan to module, define some API

d3d2dc2

refactor: Use u64 to store Empirical::sum

224c66a

refactor: Separate Empirical::mean_and_var

02a1322

refactor: Rewrite Empirical::cdf and ::sf

60661f3

refactor: Empirical::remove: Try to modify inplace

10d8c5e

The old code always removed entries and re-inserted them if necessary. The new code will instead modify the value (= data_point count) in-place and only remove the key- value pair from the map if the count would've dropped to zero.

refactor: Empirical::add: Use and_modify

35bce3e

FreezyLemon force-pushed the refactor-empirical branch from ee7245d to 35bce3e Compare November 10, 2024 22:52

refactor!: Empirical::new -> Result<_, Infallible>

b604dbb

No value of `Infallible` can ever exist, so it is statically proven that `Result<Empirical, Infallible>` can never exist as a `Result::Err` variant. This allows layout optimizations and is arguably a clearer API.

YeungOnion approved these changes Dec 3, 2024

View reviewed changes

YeungOnion merged commit 748aa55 into statrs-dev:master Dec 3, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Empirical distribution #308

Refactor Empirical distribution #308

FreezyLemon commented Nov 10, 2024

codecov bot commented Nov 10, 2024 •

edited

Loading

FreezyLemon commented Nov 11, 2024

YeungOnion commented Dec 3, 2024

Refactor Empirical distribution #308

Refactor Empirical distribution #308

Conversation

FreezyLemon commented Nov 10, 2024

codecov bot commented Nov 10, 2024 • edited Loading

Codecov Report

FreezyLemon commented Nov 11, 2024

YeungOnion commented Dec 3, 2024

codecov bot commented Nov 10, 2024 •

edited

Loading